Figure 1: 

Comparison of the predicted amino acid sequence of the T. aureum probe 'TA-PKS-1- 
consensus' and the homologous region on ORF A of Schizochytrium PKS gene cluster 
(Accession number AAK72879). 



Quality: 1269 Length: 525 

Ratio: 2.469 Gaps: 10 

Percent Similarity: 61.690 Percent Identity: 52.849 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 2 
. = 1 

TA-PKS-l-consensus .pep x aak7 2 87 9 . genpept . . 



1 LCKTLDLEWPH. . VFARS IDIELGANEETAAQAIFEELSCPDLTVREAGY 48 
1111:111 I I . I : I I I . I I ! I I I : . I | : : | | | 

2277 LCKTIGLEWSESDVFSRGVDIAQGMHPEDAAVAIVREMACADIRIREVGI 2326 

• • • . • 

4 9 TKDGKRWTTEARPVGLGKPKQALRSSDVFLVSGGARGITPVCVRELAKSI 98 
..II I • II..: I I I I I I I II I I I I . I : I I : : I 
2327 GANQQRCTIRAAKLETGNPQRQIAKDDVLLVSGGARGITPLCIREITRQI 237 6 

99 SGGTFVLLGRSPL . ADDPAWACGV . EEANIGTAAMAHLKAEFAAGRGPKP 146 
. I I : : I I II I • I : I I I I : : I : II II I . I I I I I I 
2377 AGGKYILLGRSKVSASEPAWCAGITDEKAVQKAATQELKRAFSAGEGPKP 2426 

147 T PKAHKALVG S VLGARE VLG S LE S I RAQGARAE YVS C DVS C AERVKAWD 196 
I I : I II I I I I I I I I I I : . I I I : I I I I I I . I I I 
2427 TPRAVTKLVGSVLGAREVRSSIAAIEALGGKAIYSSCDVNSAADVAKAVR 2476 

• • . • . 

197 DLERRVGA . VTGWHASGVLRDKSVERLELADFEWYGTKVDGLLNLLQA 245 
I I ..II I - I : I I I I I I I II : : I : : I : I : I I I I II I I I I 

24 77 DAE S QLGARVSG I VHAS GVLRDRL I EKKLPDE FDAVFGTKVTGLENLLAA 252 6 

24 6 VDRPKLRHLVLFSSLAGFHGNTGQAVYAMANEALNKMAFHLETAMPGLSV 2 95 
III I : I : I I I I I I I I I I I I . I I • I I I I I I I II I I III .11 
2527 VDRANLKHMVLFSSLAGFHGNVGQSDYAMANEALNKMG . . LELA . KDVSV 2573 

296 KTIGFGPWDGGMVNDALKAHFASMGVQIIPLDGGAETVSRIIGACSPTQV 345 
I • I I I I I I I I I I II I I I I I I I I : II I : I I - I.I : II:: 
2574 KS I CFGPWDGGMVTPQLKKQFQEMGVQ 1 1 PREGGADTVARI VLGSS PAE I 2 623 



346 LVGNWGLPPWPNASVHKITVRLGGESANPFLSSHTIQGRKVLPMTXALG 395 
I I I I I I . : : : .11111 I I I I I : I I I I I I : I 

2 624 LVGNWRTPSKKVGSDTITLHRKISAKS . NPFLEDHVIQGRRVLPMTLAIG 2 672 



396 LLAE AARGL YVGHQ VXG I E DAQVFQGWLDKGATCEVQLRRE S S T AS P SE 4 45 
III I I : I : . I : I I I . I . I I . I II I I III 

2673 SLAETCLGLFPGYSLWAIDDAQLFKGVTVDGDVNCEVTL; .TPSTAPSGR 2720 

• * . . • 

44 6 WLSASLNVFAAGKWPAYRAHWLGASGPRTGGVQLELKDLGVDADPAC 4 95 
1*1.1 I . . I I . I I I I I I : I I I : : I I I I I I 

2721 VNVQATLKTFSSGKLVPAYRAVIVLSNQGAPPANATMQPPSL. . DAD PAL 2768 

• • 

4 96 S VGKG AL Y DGRTLF HG P AFQ YMDEV 520 

. I . . I I I : I I I I I I I I . .1:1 
27 6 9 . . . QGSVYDGKTLFHGPAFRGIDDV 27 90 



Figure 2: 

Comparison of the predicted amino acid sequence of the T. aureum probe 'TA-PKS-1- 
consensus' and the homologous region on ORF 5 of Shewanella PKS gene cluster 
(Accession number AAB81 123). 

Quality: 641 Length: 551 

Ratio: 1.233 Gaps: 16 

Percent Similarity: 47.379 Percent Identity: 39.919 

Match display thresholds for the alignment (s) : 
I = IDENTITY 
: = 2 
. = 1 

TA-PKS-l-consensus .pep x aab81123 . genpept 

• • . . 

1 LCKTLDLEWPHVFARSIDIELGANEETAAQAIFEELSCPDLTVREAGYTK 50 

I 1 II II I I I I.: I I . I II II -II. 

20 94 LTKTLSHEWPQVFCRALDIATDVDATHLADAITSELFDSQAQLPEVGLSL 2143 
• 

51 . DGK. . RWTTEARPVGLGKPKQALRSSDVFLVSGGARGITPVCVRELAKS 97 
Mi ll I I I I - I I I . I I I : I : I I Ml 

2144 IDGKVNRVTLVAAEAADKTAKAELNSTDKILVTGGAKGVTFECALALA.S 2192 
* • • • . 

98 I S GGTFVLLGRS PLADDPAWACGVEE AN I GTAAMAHLKAEFAAGRGPKPT 147 
I 1:11111 I . I I I : . : . I I . | | : | | | | 

2193 RSQSHFILAGRSELQALPSWAEGKQTSELKSAAIAHI ISTGQKPT 2237 

• 
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II -II I • I : . I . I I I I I I I I I . . : | 

2238 PKQVEAAVWPVQSSIEINAALAAFNKVGASAEYVSMDVTDSAAITAA. . . 2284 
• 
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I I : I I . : I . I I I I I : : I I : I I I I M . I I | | | . : 
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• 
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I : : I : I I I I I I : I I I I . I I I . I : I I I I I I . 
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■ I I I I I I I I I : I I | Ml . : | . I.I: 

2385 FNWGPWDGGMVNPALKKMFTERGVYVIPLKAGAELFATQLLAETGVQLLI 2434 

348 G NWG. . LPPWPNASVHK . .-. . .IT.VRLG 369 

I I I I I I I : | | | 

24 35 GTSMQGGSDTKATETASVKKLNAGEVLSASHPRAGAQKTPLQAVTATRLL 24 84 
- 

37 0 GESANPFLSSHTIQGRKVLPMTXALGLLAEAARGLYVGHQVXGIEDAQVF 419 

II I : III M I I : : I | I : . I I I : I . . 



2485 TPSAMVFIEDHRIGGNSVLPTVCAIDWMREAASDM . LGAQVK . VLD YKLL 2532 



420 QGWLDKGATCEVQLRRESSTASPSEWLSASLNVFAAGKWPAYRAHW 469 

- I : I : I - I I - I II:. I : I I : I • : 
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. : 1:1 .1.111 II II I I I . 

2577 SDNADIKQLNKQFDL SAKAITTAK . ELYSNGTLFHGPRLQGIQS 2619 

520 V 520 
I 

2620 V 2620 



Figure 3: 

Organization of PUFA-PKS genes from Thraustochytrium aureum ATCC 34304 
ORF A- 8748 bp 
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KS= (3-keto acyl synthase 
MAT= MalonylCoA 
transferase 

ACP= Acyl carrier protein 
KR= Ketoacyl-ACP reductase 
AT= Acyl transferase 
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FIGURE 4 

Sequence ID Nos. and Corresponding Sequences: 
SEQ ID NO 1: 

5'-AGC GGA TAA CAA TTT CAC ACA GG-3' 
SEQ ID NO 2: 

CACGAGGCCAAGCATTCGAGCAAAGCGCTCAACCAGCAGATCCCAGG 

CGGGCGCGCCTGCTTCGTGGGCGTCTCGCGAATCGACGGACAGCTCG 

GACTTAGCGGAGCTTGCGCGAAAGGAAAGGGCTGGGCTGAGGCCGCA 

GAGATTGCTCAGCAAGGAGCCGTCGCAGGCTTGTGCAAGACCTTGGA 

CCTAGAGTGGCCGCACGTCTTCGCTCGCAGCATCGACATCGAGCTTGG 

CGCGAACGAAGAAACAGCTGCGCAAGCAATCITTGAGGAGCTCTCTT 

GCCCGGACCTAACGGTGCGCGAAGCAGGATACACCAAAGACGGCAA 

GCGGTGGACGACTGAGGCGCGACCGGTTGGGCTTGGCAAGCCCAAGC 

AGGCACTACGTTCTTCGGACGTCTTCTTGGTTTCTGGTGGGGCGCGGGG 

AATTACACCTGTTTGCGTTCGCGAGTTGGCCAAATCGATCAGTGGTGG 

CACTTTTGTCCTCCTCGGGCGGTCCCCTCTCGCTGATGATCCGGCGTGG 

GCTTGCGGCGTCGAGGAAGCAAACATTGGGACAGCCGCTATGGCGCA 

CCTCAAGGCCGAGTTCGCAGCCGGGCGCGGCCCGAAGCCGACGCCAA 

AGGCCCACAAAGCACTCGTTGGGAGCGTCCTGGGGGCGCGCGAAGTC 

CTTGGTTCGCTAGAGAGTATTCGCGCCCAGGGTGCGCGCGCCGAGTAC 

GT 

SEQ ID NO:3: 

TCGCCAACACAAGTTCTGGTTGGCAACTGGGGCTTGCCCCCTGTAGTT 
CCTAACGCGAGCGTGCACAAGATTACTGTGAGGCTTGGCGGGGAGTC 
TGCAAACCCTTTCCTGTCCTCCCACACGATTCAAGGCAGAAAGGTCTT 



GCCGATGACTGYGGCGCTTGGGCTTCTCGCTGAGGCGGCTCGAGGGCT 

CTACGTCGGTCACCAAGTAGYCGGGATTGAGGACGCCCAAGTCTTCCA 

GGGAGTCGTGTTGGACAAAGGGGCGACGTGTGAGGTCCAGCTTCGCC 

GCGAGTCTTCGACTGCAAGCCCAAGCGAGGTTGTGCTGAGTGCTTCGC 

TCAATGTATTCGCGGCGGGAAAGGTTGTGCCTGCGTACCGCGCGCATG 

TCGTGCTCGGCGCTTCAGGGCCACGCACTGGCGGCGTGCAGCTTGAAC 

TGAAAGATTTGGGCGTGGACGCCGACCCTGCTTGCTCCGTTGGCAAGG 

GTGCGCTGTACGACGGTAGGACGCTGTTCCATGGGCCGGCGTTTCAGT 

ACATGGATGAGGTTCCCTGGTGCTCGCCTGCAGAGCTTGCCGTGCGGT 

GCCGTGTCGTTCCGAGCGCGGCTCAGGACCGCGGCCAATATGTTTCGC 

GCGGAGTGTTGTACGACCCGTTCCTGAACGACACGGTGTTTCAAGCTC 

TCCTTGTTTGGGCCCGTCTGGTCAGGGACAGCGCTTCGCTACCGAGCA 

ACGTTGAACGAATCTCGTTCCACGGCCAGCCGCCGAGCGAGGGCGAG 

GTGTAGTACACCACGCTCAAGCTGGACAGTGCTGCGAGCGGGCCGCT 

CGACCCGATTGCAACAGGCGCATTTCTTCCTCCACCGAGCTTGCGGGG 

CGGTCTTTGCATCAGGGCGAGCGAGTGTGGTTCTGAACAAGGCTCTTT 

CGTATGATGGCTCTCGACCCAAAGGCGAGTAGAGTACTCTACTCAGTA 

CTCCTTTTCACATACCGGCAGGCAGCGTTGCTGTGGGATGGCCGGGGG 

CTCTTCTGCACGCGGCTCC 

SEQ ID NO: 4: 

GAATTCGGCACGAGGCCGGCCTCACGACGCAGGTTGTTCGGTTCCGCG 

CTGCAGGTCTGTCACGCAACGCGGACGGCTCTGTTCGAGTCCGCAACC 

GCATCATCGGAAAGATTTCGCGCACGGAGCTCGCGGAGATGTTCATTC 

GCCCCGCTCCGGAGGCCCTCTTGACCAAGTTGGTTGCGTCGGGTGAGA 

TTTCGGCCGAGCAGMNGCCTGGCCAAACAAGTGCCGATGCCGACGAC 

ATTGCCGTCGAGSAGAACTCGGGCGGCCACACGGACAATCGCCCGAT 

CCATGTCATCCTTCCGCTGATCATCGCGCTCCGCAACAGGCTGCACAA 

GGAGTGCGGTTACCCGGCGAGCCTTCGCGTTCGAGTTGGCGCGGGTGG 



CGGGATCGGCTGCCCGCTTGCAGCAACTGCGGCCTTCAACATGGGCGC 

CGCCTTTCTCGTGACAGGAACAGTCAACCAACTCAGCCGGCAGTCGG 

GCACCTGCGACGCGGTGCGCATSAGCTTTTCAAAAGCGACCTACTCGG 

ACATCACAATGGCGCCCGCCGCAGATATGTTTGACCAGGGGGTTGAG 

CTCCAGGTGCTCAAGAAGGGCACCATGTTTCCGTCGCGCGCCAAGAA 

GCrCTACGAGCTGTTTTGCACGTACAACTCGKTCGACGAGATGCCCGC 

CGAGGAGCTCGCGCGGGTTGAGAAGCSGATYTTCCAAAAGCCCCTCG 

CGGSCGTATGGGACGAGACGAAAGACTTTTACATCAACCGTCTCCACA 

ACGAGGACAAGATCGAACGCGCAGAAAAGGATGGCAAGCTCAAGAT 

GTCGCTCTCGTTCCGCTGGTACCTTGGCCTGAGTTCGTTCTGGGCCAAC 

AATGGAATCGCCGACCGCGTGCTGGACTATCAAGTGTGGTGCGGCCCT 

GCGATTGGGGCCTGGAACGACnTGCCAAGGGATCCTACCTCGACGCC 

GAGGTCTGCGGCCAGTTTCCTTGCGTTGTGCAGGTCAACCTGCAGATC 

CTCCACGCGCGGCCTACATGCAGCGCCTTCTGGCCGTCAAGCATGACC 

CGCGCATCGAGTTTGACCTCGAGGACCCGGTCTTTGGTACGCCCCCAC 

TGCCGCGCTCTAAAGCGATGCAGCAACGCACTCTTTCGGAGGCCCGTC 

GCTGCAGCACTTGTGCGAACTCGATAGGGTTTCTTTCAAGATTTCAATC 

AACAAAACAAGTATTGGAATGACAAAAAAAAAAAAAAAAACTCGAG 

SEQ ID NO:5: 

5'- CTT GTG CAA GAC CTT GGA CCT AGA G-3' 
SEQ ID NO:6: 

5'-GAA CCT CAT CCA TGT ACT GAA ACG C-3' 
SEQ ID NO:7: 

TTGTGCAAGA CCTTGGACCT AGAGTGGCCG CACGTCTTCG 
CTCGCAGCATCGACATCGAG CTTGGCGCGA ACGAAGAAAC 
AGCTGCGCAA GCAATCTTTGAGGAGCTCTC TTGCCCGGAC 



• « 



CTAACGGTGC GCGAAGCAGG ATACACCAAAGACGGCAAGC 
GGTGGACGAC TGAGGCGCGA CCGGTTGGGC TTGGCAAGCC 
CAAGC AGGC A CTACGTTCTT CGGACGTCTT CTTGGTTTCT 
GGTGGGGCGCGGGGAATTAC ACCTGTTTGC GTTCGCGAGT 
TGGCCAAATC GATCAGTGGTGGCACTTTTG TCCTCCTCGG 
GCGGTCCCCT CTCGCTGATG ATCCGGCGTGGGCTTGCGGC 
GTCGAGGAAG CAAACATTGG GACAGCCGCT ATGGCGCACC 
TCAAGGCCGA GTTCGCAGCC GGGCGCGGCC CGAAGCCGAC 
GCCAAAGGCCCACAAAGCAC TCGTTGGGAG CGTCCTGGGG 
GCGCGCGAAG TCCTTGGTTCGCTAGAGAGT ATTCGCGCCC 
AGGGTGCGCG CGCCGAGTAC GTTTCCTGCGACGTTTCGTG 
TGCGGAGCGC GTCAAGGCCG TCGTCGACGA TCTCGAGCGA 
CGGGTCGGGG CTGTAACTGG GGTTGTGCAC GCCTCTGGTG 
TTCTCCGAGACAAGTCCGTT GAGCGCTTGG AGCTCGCCGA 
CTTCGAGGTC GTGTACGGCACC AAGGTGGA CGGCCTGCTC 
AACCTGCTGC AGGCCGTGGA CCGCCCCAAACTCCGGCACT 
TGGTCCTCTT CAGCTCCCTG GCCGGTTTCC ACGGCAACAC 
TGGGCAGGCC GTGTACGCTA TGGCGAATGA GGCGCTGAAC 
AAGATGGCCTTCCATTTGGA AACTGCGATG CCTGGCCTCT 
CGGTCAAGAC GATCGGGTTTGGACCTTGGG ACGGCGGCAT 
GGTCAACG AT GCGCTG AAAG CGCACTTTGCGTCTATGGGC 
GTCCAAATTA TTCCGCTCGA CGGYGGCGCG G AGACCGTTT 
CCCGAATCAT CGGGGCGTGC TCGCCAACAC AAGTTCTGGT 
TGGCAACTGGGGCTTGCCCC CTGTAGTTCC TAACGCGAGC 
GTGCACAAGA TTACTGTGAGGCTTGGCGGG GAGTCTGCAA 
ACCCTTTCCT GTCCTCCCAC ACGATTCAAGGCAGAAAGGT 
CTTGCCGATG ACTGYGGCGC TTGGGCTTCT CGCTGAGGCG 
GCTCGAGGGC TCTACGTCGG TCACCAAGTA GYCGGGATTG 
AGGACGCCCAAGTCTTCCAG GGAGTCGTGT TGGACAAAGG 



GGCGACGTGT G AGGTCCAGCTTCGCCGCG A GTCTTCGACT 
GCAAGCCCAA GCGAGGTTGT GCTGAGTGCTTCGCTCAATG 
TATTCGCGGC GGGAAAGGTT GTGCCTGCGT ACCGCGCGCA 
TGTCGTGCTC GGCGCTTCAG GGCCACGCAC TGGCGGCGTG 
CAGCTTGAACTGAAAGATTT GGGCGTGGAC GCCGACCCTG 
CTTGCTCCGT TGGCAAGGGTGCGCTGTACG ACGGTAGGAC 
GCTGTTCCAT GGGCCGGCGT TTCAGTACATGGATGAGGTT C 

SEQ ID NO:8: 

CGCAAGTGCATCCGGCCATCATTGGGCCATCATTGGGCCATCATTGGT 

GTTTTGGGCCGCGCTTTGCGGATCGTCCGGCCGATCAGGTACGAGGCC 

ACGAACCTACGTCGTTTGCCGCGCTCAGGCTGGTTGGTTGCACTTGGA 

CTCTTCTGTGACCTTTCATCGTGTGCAGGCAAACTCGATTTGCAGACCC 

GAGACACGGCGAAGGATCCGTGCTGCAAACGCAAGTGGAGTGCGTCG 

AGAGCACCGCCGAGACCAAGAGCCGAGGCAGACAAGGCCAGCAACG 

AGATGGAGACAAAGGACGATCGCGTTGCGATCGTGGGCATGTCGGCC 

ATACTGCCTTGCGGTGAGTCAGTGCGCGAGTCGTGGGAGGCGATTCGC 

GAGGGGCTCGATTGCCTGCAGGACCTGCCTGCGGACCGAGTCGATAT 

CACGGCGTACTACGACCCGAACAAGACAACCAAGGACAAGATCTACT 

GCAAGCGCGGCGGCTTCATTCCCGAGTATGACTTTGACGCGCGCGAGT 

TCGGCCTCAACATGTTCCAGATGGAGGACTCGGACGCCAACCAAACC 

GTGACTTTGCTCAAGGTCAAGGAGGCTCTCGAGGACGCCGGGGTGGA 

GCCCTTCACAAAGAAGAAGAAGAACATTGGCTGCGTGCTCGGCATCG 

GCGGCGGGCAGAAGGCGAGCCACGAGTTTTACTCCCGACTCAACTAT 

GTGGTCGTGGAGAAGGTGCTTCGCAAGATGAACCTCCCCGACGAGGT 

TGTCGAGGCCGCCGTCGAAAAGTACAAGGCCAACTTTCCTGAATGGC 

GCCTCGACTCGTTCCCTGGGTTTCTTGGCAACGTGACCGCCGGGCGGT 

GCAGCAACGTCTTCAACATGGAAGGCATGAACTGCGTCGTGGACGCT 

GCGTGCGCCAGCTCGCTCATCGCGATCAAGGTTGCCATTGATGAGCTC 



CTCCACGGGGACTGCGACACCATGATTGCCGGTGCGACCTGCACCGA 

CAACTCGATCGGGATGTACATGGCCTTTTCCAAAACCCCAGTTTTCTCC 

ACCGACCAGAGCGTCAAGGCGTACGACGCCAAGACGAAAGGCATGC 

TCATCGGCGAAGGCTCGGCCATGGTCGTGCTCAAGCGGTACGCGGAC 

GCCGTTCGGGATGGTGATGAGATCCATGCCGTCATCAGGGCATGCGCC 

TCGTCCAGCGACGGCAAGGCTGCTGGCATTTACGCACCGACGGTGTCG 

GGTCAAGAAGAGGCACTGCGGCGCGCGTACGCCCGAGCTGGCGTGGA 

CCCCTCCACCGTCACGCTGGTGGAGGGCCACGGCACTGGCACACCCG 

TCGGGGACCGGATTGAGCTGACCGCCTTGCGCAACGTCTTTGACGCAG 

CCAACAAAGGCCGCAAGGAAACAGTCGCGGTGGGAAGCATCAAGTC 

GCAGATCGGTCACCTGAAGGCCGTGGCCGGCTTTGCCGGTCTCGTCAA 

GGTTGTCATGGCCCTCAAGCACAAGACGCTGCCGCAGACCATCAACG 

TTCACGACCCGCCCGCACTGCACGACGGCTCGCCCATCCAGGATTCGA 

GTCTTTACATCAACACGATGAACCGGCCCTGGTTTACGGCACCTGGCG 

TCCCCCGCCGTGCAGGCATCTCrAGCTTTGGGTTTGGCGGCGCCAACT 

ACCACGCTGTTCTCGAAGAGGCCGAGCCTGAGCACGCGAAGCCGTAT 

CGCATGAACCAAGTTCCACAACCGGTGCTCTTGCACGCAAGCTCCGCG 

TCAGCTCTTGCCTCCATCTGCGACGCTCAGGCCGACGCGCTCCAGGCC 

GCCGTCTCGCCCGAAGCCAGCAAGCACGCAGACTACCGCGCCATCGT 

AGCGTTCCATGAAGCGTTTAAGCTTCGCGCTGGAGTGCCGGCCGGCCA 

TGCTCGAATTGGCTTTGTGTCCGGCAGCGCGGCAGCAACGCTTGCAGT 

GCTCCGAGCCGCCTCTGCAAAACTCAAGCAGTCGAGTGCGACGCTCG 

AATGGACCCTGCTCCGCGAGGGCGTCACGTACCGCTCCGCCGCGATG 

CACACTCCTGGCAGTGTCGCTGCTCTGTTTGCCGGGCAAGGCGCGCAG 

TACACGCACATGTTCGCTGACGTTGCCATGAACTGGCCACCGTTTCGA 

AGCGCCGTGCAAGAGATGGATGCCGCTCAAGTCACGGCGGCAGCGCC 

GAAGCGCCTCAGCGAGGTCCTGTATCCGCGCAAGCCGTACGCTGCAG 

AGCCCGAGCAAGACAACAAGGCCATCTCGATGACGATTAACTCGCAA 

CCGGCCCTCATGGCCTGCGCTGCTGGGGCGTTTGAGGTG'nTCGTCAA 



GCTGGTCTTGCGCCCGACCACGTCGCGGGTCATTCTCTCGGCGAGTTT 

GGTGCTTTGCTCGCCGCTGGATGCGCAAGCCGTGAGGAGCTCTTCCGT 

CTGGTCTGCAGCAGAGCGAAGGCAATGCAAGACGTTCCCAAGCCAAG 

CGAGGGCGTCATGGCAGCTGTCATCGGCCGTGGTGCTGACAAGCTCA 

CGCTGCAAGGCGATGGTGCGTGGCTTGCCAACTGCAACTCGCCAAGC 

CAAGTGGTCATTTCCGGCGACAAGACTGCTGTCGAGCGTGAATCCAGC 

CGGTTGGCAGGCCTTGGCTTCAGGATCATTCCGCTTGCATGCGAAGGC 

GCCTTCCATTCACCGCACATGACGGCGGCCCAGGCCACGTTTCAGGCT 

GCACTGGACAGCCTCAAGATCTCCACCCCGACGAACGGGGCGCGCCT 

GTACAACAACGTTTCCGGAAAGACCTGCCGATCCCTGGGTGAACTCC 

GCGACTGCCTGGGCAAGCACATGACAAGTCCTGTGCTCTTCCAGGCAC 

AGGTAGAGAACATGTACGCTGCCGGGGCGCGCATTTTCGTGGAGTTTG 

GCCCGAAGCAAGTCCTCTCCAAGCTCGTAGGCGAGATTCTCGCCGAC 

AAGTCAGACTTTGTGACAGTCGCGGTCAACTCGTCATCGTCCAAGGAC 

AGCGACGTGCAACTTCGTGAAGCTGCTGCGAAGCTCGCGGTCCTTGGC 

GTCCCGTTGGCGAACTTTGACCCTTGGGAGCTCTGCGACGCGCGGCGT 

CTTCGCGAATGCCCGCGATCCAAGACGACGTTGCGCTTGTCTGCAGCG 

ACCTACGTGTCGAACAAGACCCTTGCTGCTAGGGAGAAGGTCATGGA 

GGACAACTGCGACTTTTCTTCGCTCTTTGCCTCCGGTCCAGCAAGCCA 

AGAGATGGAGCGAGAAATAGCCAACCTTCGCGCTGAGCTGGAGGCGG 

CCCAACGCCAGCTTGACACGGCCAAAACCCAGCTTGCTCGAAAGCAA 

GTGCAGGACCCCACCGCTGACCGACAGCGCGATATGATTGCCAAGCA 

CCGATCCACACTCGCAGCAATGGTGAAGGAATTCGAGGCTCTGGCAA 

GTGGTAGTCCTTGCGCTGTTCCGTTTGCGCCTGTGGTGGACACTGCTGT 

CGAAGACGTGCCTTTTGCGGACAAGGTCTCGACGCCACCGCCCCAAG 

TCACTTCCGCTCCCATCGCCGAGCTCGCGCGCGCCGAGGCCGTCGTCA 

TGGAGGTTCTCGCTGCCAAGACTGGCTACGAGGTCGACATGATCGAG 

GCCGACATGCTGCTCGACGCCGAGCTCGGCATCGACTCGGTCAAGCG 

CATTGAGATCCTGGCAGCTGTCCAGGCCCAGCTCGGGGTCGAGGCCA 



AGGACGTCGACGCGCTCAGCCGCACACGAACAGTTGGCGAGGTCGTT 

GACGCCATGAAGGCTGAGATCGGCGGGCAAGCGACCAGTGCGCCTTC 

GCCGATGGCCCAGCCCCAAGCCTCAGCACCATCACCGTCCCCTACTGC 

CTCTGTGCTGCCTAAGCCTGTTGCTTTACCAGCTAGTGTCGATCCCGCC 

AAGCTCGCGCGCGCCGAAGCGGTCGTCATGGAGGTTCTCGCCGCCAA 

GACTGGCTACGAGGTCGACATGATCGAGGCTGACATGCTGCTCGACG 

CCGAGCTCGGCATCGACTCGGTCAAGCGCATTGAGATCCTGGCGGCTG 

TCCAAGCTCAGCTCGGGGTCGAGGCCAAGGATGTCGACGCGCTCAGC 

CGCACACGCACTGTTGGCGAGGTCGTTGATGCCATGAAGGCTGAGAT 

CGGCGGGCAAGCGACCAGCGCACCTGCGTCCGTGGCCCAGCCCCAAG 

CCTCAGCACCATCACCGTCCGCAACAACTGCCTCTGTGCTGCCTAAGC 

CTGTTGCTGCACCAACTAGCGCCGATCCCGCCAAGCTCGCGCGCGCCG 

AAGCCGTCGTCATGGAGGTTCTCGCTGCCAAGACTGGCTACGAGGTCG 

ACATGATCGAGGCTGACATGCTGCTCGACGCCGAGCTCGGCATCGACT 

CGGTCAAGCGCATTGAGATCCTGGCGGCTGTCCAAGCCCAGCTCGGG 

GTCGAGGCCAAGGACGTCGACGCGCTCAGCCGCACACGCACGGTTGG 

CGAGGTCGTCGAGGCCATGAAGGCTGAGATCGGCGGGCAAGCGACC 

AGTGCACCTGCGTCCGTGGCCCAGCCCCAAATCTCTGTGTCCCCTACG 

CCTCTCGCTGCATCTCCTAGTGCCGATCCTGCCAAGCTCGCGCGCGCC 

GAAGCCGTCGTCATGGAGGTTCTCGCTGCCAAGACTGGCTACGAGGTC 

GACATGATCGAGGCTGACATGCTGCTCGACGCCGAGCTCGGCATCGA 

CTCCGTCAAGCGCATCGAGATCCTGGCGGCTGTCCAGGCCCAGCTCGG 

GGTCGAGGCCAAGGACGTCGACGCGCTCAGCCGCACACGCACTGTTG 

GCGAGGTCGTTGACGCCATGAAGGCTGAGATCGGCGGGCAAGCGACC 

AGTGCGCCTGCATCCGTGGCCCAGCCCCAAGCCTCAGCACCGTCGCC 

GTCCGCTACTGCCTCTGTGCTGCCTAAGCCTGTTGCTGCACCAACTAGC 

GCCGATCCCGCCAAGCTCGCGCGCGCCGAAGCCGTCGTCATGGAGGT 

TCTCGCTGCCAAGACTGGCTACGAGGTCGACATGATCGAGGCTGACAT 

GCTGCTCGACGCCGAGCTCGGCATCGACTCGGTCAAGCGCATCGAGA 



TCCTGGCGGCTGTCCAAGCCCAGCTCGGGGTCGAGGCCAAGGACGTC 

GACGCGCTCAGCCGCACACGCACGGTTGGCGAGGTCGTCGAGGCCAT 

GAAGGCTGAGATCGGCGGGCAAGCGACCAGTGCACCTGCGTCCATGG 

CCCAGCCCCAAATCTCTGTGTCCCCTACGCCTCTCGCTGCATCTCCTAG 

TGCCGATCCTGCCAAGCTCGCGCGCGCCGAGGCCGTCGTCATGGAGGT 

TCTCGCTGCCAAGACTGGCTACGAGGTCGACATGATCGAGGCCGACA 

TGCTGCTCGACGCCGAGCTCGGCATCGACTCGGTCAAGCGCATCGAG 

ATCCTGGCGGCTGTCCAAGCTCAGCTCGGGGTCGAGGCCAAGGACGT 

CGACGCGCTCAGCCGCACACGCACGGTTGGCGAGGTCGTTGATGCCA 

TGAAGGCTGAGATCGGCGGGCAAGCGACCAGTGCGCCTGCATCCGTG 

GCCCAGCCCCAAGCCTCAGCACCGTCGCCGTCCGCTACTGCCTCTGCG 

CCTGTTACGCCTCTCGCTGCACCAGCTAGTGTCGATCCCGCCAAGCTC 

GCGCGCGCCGAAGCCGTCGTCATGGAGGTTCTCGCCGCCAAGACTGG 

CTACGAGGTCGACATGATCGAGGCTGACATGCTGCTCGACGCCGAGC 

TCGGCATCGACTCCGTCAAGCGGATTGAGATCCTGGCGGCTGTCCAAG 

CCCAGCTCGGGGTCGAGGCCAAGGACGTCGACGCGCTCAGCCGCACA 

CGCACTGTTGGCGAGGTCGTTGACGCCATGAAGGCTGAGATCGGCGG 

GCAAGCGACCAGCGCACCTGCGTCCGTGGCCCAGCCCCAAGCCTCAG 

CACCGTCGCCGTCCGCTACTGCCTCTGTGCTGCCTAAGCCTGTTGCTTC 

ACCAGCTAGTGTCGATCCCGCCAAGCTCGCGCGCGCCGAAGCGGTCG 

TCATGGAGGTTCTCGCTGCCAAGACTGGCTACGAGGTCGACATGATCG 

ACGCTGACATGCTGCTCGACGCCGAGCTCGGCATCGACTCCGTCAAGC 

GCATCGAGATCCTGGCGGCTGTCCAAGCCCAGCTCGGGGTCGAGGCC 

AAGGACGTCGACGCGCTCAGCCGCACACGAACGGTTGGCGAGGTCGT 

CGAGGCCATGAAGGCTGAGATCGGGGCAGCAGGTCCAAACGATGCA 

CAAGCAGCGTCTGGGCATCTCTTTGGCACGGGATGTGAAGACCTGAG 

CCTTTGCTCTGCTrCTGTGGTTGAGATTGCTCGTTGCAGCGAACTAGCT 

CTGGAGCGCCCGATGGATCGGCCCATTCTTATTGTAAGCGATGGATCA 

GCATTGCCGGCGGCTCTGGCTAGTCGACTGGGGTCGTGTGCAGTAATC 



CTCACGACCGCAGGCGAGACCGACCAATCTGTGCGCTCGACGAAGCA 

CGTTGACATGGAAGGGTGGGGCGAGGCAGATCTCGTGCGCGCTCTTG 

AAGCAGTAGAGTCTCGATTCGGCGTCCCAGGCGGCGTCGTGGTGCTTG 

AGCGCGCCTCAGAAACAGCTAGGGACCAGCTTGGCTTTGCCCTGCTGC 

TTGCCAAGCATTCGAGCAAAGCGCTCAACCAGCAGATCCCAGGCGGG 

CGCGCCTGCTTCGTGGGCGTCTCGCGAATCGACGGAAAGCTCGGACTT 

AGCGGAGCTTGCGCGAAAGGAAAGGGCTGGGCTGAGGCCGCAGAGA 

TTGCTCAGCAAGGAGCCGTCGCGGGCTTGTGCAAGACCTTGGACCTAG 

AGTGGCCGCACGTCTTCGCTCGCAGCATCGACATCGAGCTTGGCGCGA 

ACGAAGAAACAGCTGCGCAAGCAATCTTTGAGGAGCTCTCTTGCCCG 

GACCTAACGGTGCGCGAAGCAGGATACACCAAAGACGGCAAGCGGT 

GGACGACTGAGGCGCGACCGGTTGGGCTTGGCAAGCCCAAGCAGGCA 

CTACGTTCTTCGGACGTCTTCTTGGTTTCTGGTGGGGCGCGGGGAATTA 

CACCTGTTTGCGTTCGCGAGTTGGCCAAATCGATCAGTGGTGGCACTTT 

TGTCCTCCTCGGGCGGTCCCCTCTCGCTGATGATCCGGCGTGGGCTTGC 

GGCGTCGAGGAAGCAAACATTGGGACAGCCGCTATGGCGCACCTCAA 

GGCCGAGTTCGCAGCCGGGCGCGGCCCGAAGCCGACGCCAAAGGCC 

CACAAAGCACTCGTTGGGAGCGTCCTGGGGGCGCGCGAAGTCCTTGG 

TTCGCTAGAGAGTATTCGCGCCCAGGGTGCGCGCGCCGAGTACGTTTC 

CTGCGACGTTTCGTGTGCGGAGCGCGTCAAGGCCGTCGTCGACGATCT 

CGAGCGACGGGTCGGGGCTGTAACTGGGGTTGTGCACGCCTCTGGTGT 

TCTCCGAGACAAGTCCGTTGAGCGCTTGGAGCTCGCCGACTTCGAGGT 

CGTGTACGGCACCAAGGTGGACGGCCTGCTCAACCTGCTGCAGGCCG 

TGGACCGCCCCAAACTCCGGCACTTGGTCCTCTTCAGCTCCCTGGCCG 

GTTTCCACGGCAACACTGGGCAGGCCGTGTACGCTATGGCGAATGAG 

GCGCTGAACAAGATGGCCTTCCATTTGGAAACTGCGATGCCTGGCCTC 

TCGGTCAAGACGATCGGGTTTGGACCTTGGGACGGCGGCATGGTCAA 

CGATGCGCTGAAAGCGCACTTTGCGTCTATGGGCGTCCAAATTATTCC 

GCTCGACGGCGGCGCGGAGACCGTTTCCCGAATCATCGGGGCGTGCT 



CGCCAACACAAGTTCTGGTTGGCAACTGGGGCTTGCCCCCTGTAGTTC 

CTAACGCGAGCGTGCACAAGATTACTGTGAGGCTTGGCGGGGAGTCT 

GCAAACCCTTTCCTGTCCTCGCACACGATTCAAGGCAGAAAGGTCTTG 

CCGATGACTGTGGCGCTTGGGCTTCTCGCTGAGGCGGCTCGAGGGCTC 

TACGTCGGTCACCAAGTAGTCGGGATTGAGGACGCCCAAGTCTTCCAG 

GGAGTCGTGTTGGACAAAGGGGCGACGTGTGAGGTCCAGCTTCGCCG 

CGAGTCTTCGACTGCAAGCCCAAGCGAGGTTGTGCTGAGTGCTTCGCT 

CAATGTATTCGCGGCGGGAAAGGTTGTGCCTGCGTACCGCGCGCATGT 

CGTGCTCGGCGCTTCAGGGCCACGCACTGGCGGCGTGCAGCTTGAACT 

GAAAGATTTGGGCGTGGACGCCGACCCTGCTTGCTCCGTTGGCAAGGG 

TGCGCTGTACGACGGTAGGACGCTGTTCCATGGGCCGGCGTTTCAGTA 

CATGGATGAGGTTCTTCGGTGCTCGCCTGCAGAGCTTGCCGTGCGGTG 

CCGTGTCGTTCCGAGCGCGGCTCAGGACCGCGGCCAATTTGTTTCGCG 

CGGAGTGTTGTACGACCCGTTCCTGAACGACACGGTGTTTCAAGCTCT 

CCTTGTTTGGGCCCGTCTGGTCAGGGACAGCGCTTCGCTACCGAGCAA 

CGTTGAACGAATCTCGTTCCACGGCCAGCCGCCGAGCGAGGGCGAGG 

TGTTTTACACCACGCTCAAGCTGGACAGTGCTGCGAGCGGGCCGCTCG 

ACCCGATTGCAAAGGCGCAGTTCTTCCTCCACCGAGCTTGCGGGGCGG 

TCTTTGCATCAGGGCGAGCGAGTGTGGTTCTGAACAAGGCTCTTTCGTT 

TTGA 

SEQ ID NO:9: 

CAAGCAATCGGCCATCGAGCTGCGCGTTGGAGCTGCCGATCGAAATC 

GAAAGCAAGAGGCCACAAGGCTCAGAAAGAGATGAACCAGGGCGGG 

AGAAATGACGAGGGCGTCTCGGTGGCGCGCGCGGACCCATGCCCTGA 

CACGCGGATCGCTGTCGTGGGCATGGCGGTCGAGTATGCAGGGTGCC 

GCGGCAAGGAAGCGTTCTGGGACACGCTCATGAACGGCAAAATCAAC 

TCTGCCTGTATCTCAGACGATCGCCTCGGGTCAGCACGACGAGAAGA 

GCACTATGCGCCCGAGAGGTCAAAGTACGCCGATACGTTCTGCAACG 



AGAGGTACGGATGCATCGATCCCAAAGTCGACAACGAGCACGACCTG 

CTCCTCGGCCTCGCCGCGGCTGCGCTTCAAGACGCGCAGGACAGGCG 

CAGCGACGGCGGCAAGTTCGACCCAGCGCAGCTCAAGCGCTGCGGCA 

TTGTCAGCGGCTGCCTGTCCTTCCCGATGGACAACCTGCAAGGCGAGC 

TGCTCAACCTTTACCAAGCCCATGCTGAGAGGCGGATTGGCAAGCATT 

GCTTCGCGGACCAAACGCCCTGGTCGACGCGAACCAGAGCGCTTCAC 

CCGCTGCCCGGGGACCCGAGGACCCACCGCGACCCAGCCTCCTTCGT 

CGCCGGACAGCTCGGCCTCGGCCCGCTGCACTACTCGCTCGACGCCGC 

CTGCGCCTCGGCCCTTTACGTTCTGCGACTCGCTCAGGACCACCTCCTC 

TCGGGCGAGGCTGACTTGATGCTGTGCGGAGCGACGTGCTTCCCAGAG 

CCCTTCTTCATCCTGACTGGGTTTAGCACGTTCCACGCGATGCCAGTCG 

GTGAGAACGGTGTCTCGATGCCGTTTCATCGGGACACGCAAGGGCTG 

ACGCCCGGCGAGGGCGGCTCGGTGATGGTGCTCAAGCGCCTCGCGGA 

CGCCGAGCGCGACGGAGACCACATCTACGGGACGCTTCTTGGAGCCA 

GCTTGAGCAACGCAGGCTGCGGGCTTCCTCTCAAGCCGCACCAGCCA 

AGCGAGGAGGCCTGCTTGAAAGCCACCTACGAGCTCGTCGGCGTGCC 

GCCCCGAGACGTCCAGTACGTCGAGTGCCACGCCACCGGCACGCCGC 

AGGGCGACACCGTCGAGCTCCAAGCCGTCAAAGCCTGCTTTGAGGGC 

GCAAGCCCCCGGATCGGGTCCACGAAAGGCAACTTCGGACACACCCT 

CGTCGCGGCCGGCTTTGCGGGAATGTGCAAGGTTCTCCTTGCAATGGA 

GCGCGGCGTGATCCCCCCGACCCCGGGCGTTGACTCTGGCACCCAGAT 

TGATCCCCTCGTCGTCACAGCGGCGCTCCCGTGGCCGGATACGCGCGG 

CGGGCCGAAACGCGCAGGACTCTCCGCATTCGGATTCGGGGGCACAA 

ACGCGCACGCCGTCTTTGAGGAGCATATTCCCTCGAGAGCTCCGCCCG 

CAGTACTCTGCCAGCCTCGCCTCGGCAGCGGACCAAACCGAAAGCTT 

GCTATCGTCGGCATGGATGCCACGTTTGGATCCTTGAAGGGTCTCTCC 

GCACTAGAAGCTGCGCTTTACGAGGCAAGGCACGCTGCGCGGCCCCT 

GCCTGCGAAGCGCTGGCGCTTCTTGGGCGGGGACGAGTCCTTTCTCCA 

CGAGATCGGACTCGAGTGCTCTCCGCACGGGTGCTACATTGAGGACGT 



GGATGTGGACTTTAAGCGACTCCGCACGCCAATGGTGCCGGAGGACT 

TGCTCCGGCCGCAACAGCTCCTGGCCGTGTCGACGATTGACAAGGCC 

ATCCTCGACTCGGGCTTGGCCAAGGGCGGCAACGTGGCTGTCCTTGTC 

GGCCTCGGGACGGACCTCGAGCTCTACCGCCACCGAGCTCGGGTTGC 

GCTTAAGGAGCGTCTTCAAGGACTGGTTCGCTCTGCCGAGGGAGGAG 

CCCTGACGTCTCGCCTGATGAACTATATCAATGATAGCGGAACGTCGA 

CCTCCTACACGTCGTATATCGGCAACCTCGTCGCCACGCGCGTCTCGT 

GCCAGTGGGGCTTCACTGGGCCGTCGTTCACCGTCACGGAAGGGGCC 

AACTCGGTCCATCGGTGCGCCCAGCTCGCCAAGTACATGCTCGACCGC 

GGCGAGGTCGACGCCGTCGTGGTTGCAGGAGTCGACCTGTGCGGGAG 

CGCCGAGGCGTTCTTCGTGAGGTCGCGCCGCATGCAGATCTCGAAAA 

GTCAGCGCCCGGCCGCGCCGTTTGACCGCGCCGCAGACGGCTTCTTCG 

CGGGGGAAGGGTGCGGCGCCCTCGTCTTCAAACGCCTGACTGACTGT 

GTGTCTGGCGAGCGAATCTACGCGTCCCTCGACTCGGTCGTCGTCGCA 

ACCACGCCGCGCGCCGCTCTTCGTGCTGCCGCAGGGTCGGCGCGGGTT 

GACCCAGCCAGCATCGACATGGTCGAGCTGAGCGCAGATTCCCACCG 

GTTTGTGCGGGCGCCAGGCACCGTGGCTCAGCCTCTGACAGCCGAAGT 

CGAGGTCGGGGCGGTGCGGGAAGTGATCGGGACCGCGGGGAGGGGC 

TCTCGAAGCGTGGCCGTCGGATCGGTCCGCGCCAACGTCGGGGACGC 

AGGGTTTGCTTCCGGGGCCGCTGCCCTCGTAAAAACTGCGCTCTGCTT 

GCACAACCGCTACTTGGCGGCTACCCCAGGCTGGGATGCGCCTGCTGC 

CGGCGTGGATTTTGGTGCCGAGCTGTACGTTTGCCGCGAGTCGCGTGC 

TTGGGTCAAGAACGCCGGCGTTGCACGGCACGCCGCAATTTCTGGCGT 

GGACGAAGGCGGGTCGTGCTATGGGCTGGTTCTTTCGGACGTGCCTGG 

GCAGTACGAGACCGGCAACCGCATCTCCCTCCAGGCCGAGTCGCCCA 

AGCTCTTGCTCCTCTCGGCTCCAGACCACGCCGCCTTGCTGGACAAGG 

TGGCGGCCGAGCTCGCAGCCCTTGAGCAAGCCGACGGCTTGAGCGCC 

GCCGCGGCTGCCGTAGACCGCTTACTCGGCGAGTCGCTCGTCGGTTGC 

GCGGCTGGCAGCGGCGGGCTGACCCTTTGCTTGGTGGCTTCGCCTGCC 



AGCCTCCACAAGGAGCTTGCGCTGGCCCATCGAGGGATCCCGCGCTG 

CATCAAAGCACGGCGCGACTGGGCCAGCCCGGCAGGGAGCTACTTCG 

CCCCGGAGCCGATCGCAAGCGACCGCGTCGCGTTCATGTACGGGGAA 

GGACGAAGCCCGTACTGCGGCGTCGGCCGCGACCTCCACCGGATCTG 

GCCCGCGCTGCATGAGCGGGTGAACGCCAAGACTGTCAACCTCTGGG 

GTGACGGTGACGCCTGGCTGCTGCCACGTGCAACCTCGGCCGAGGAA 

GAGGAGCAACTCTGCCGCAACTTCGACTCGAACCAGGTTGAGATGTTT 

CGAACGGGCGTGTACATCTCGATGTGCTTGACCGACCTCGCTCGAAGC 

TTGATTGGACTGGGCCCTAAGGCGAGCTTTGGGCTCAGCCTAGGCGAG 

GTTTCCATGCTCTTCGCTCTGAGCGAGTCCAACTGTAGACTGTCGGAG 

GAAATGACCCGCAGGCTCCGTGCGTCCCCGGTGTGGAACTCGGAGCT 

CGCCGTCGAGTTCAACGCCCTTCGAAAGTTGTGGGGGGTCGCGCCGGG 

GGCACCCGTCGACTCGTTCTGGCAAGGTTATGTCGTGCGCGCAACGCG 

GGCTCAGGTGGAGCAAGCCATTGGGGAGGACAATCAGTTTGTGCGTC 

TCCTGATCGTGAACGACTCGCAATCAGTCCTGATCGCCGGCAAGCCGG 

CGGCGTGCGAAGCCGTAATTGCTCGCATCGGGTCTATTCTTCCCCCGCT 

GCAAGTGTCGCAAGGCATGGTGGGGCACTGTGCCGAGGTCTTGCCGT 

ACACGAGCGAGATCGGGCGCATCCACAACATGCTTCGCTTCCCATCGC 

AGGACGAAACGGGCGGTTGCAAAATGTACTCTAGCGTCTCAAACTCG 

CGCATCGGGCCAGTCGAGGAGAGCCAGATGGGCCCAGGCACTGAGCT 

CGTTTTCTCGCCGTCAATGGAAGACTTTGTCGCCCAGCTGTACTCGCGA 

GTTGCAGACTTTCCGGCGATCACCGAGGCGGTTTACCAGCAGGGTCAT 

GACGTGTTTGTCGAAGTGGGGCCGGACCATTCACGGTCGGCTGCTGTC 

CGCTCCACGCTTGGACCCACTCGGCGACACATCGCTGTGGCGATGGAC 

CGCAAGGGTGAGTCAGCTTGGTCGCAGCTTCTGAAAATGCTGGCTACG 

CTTGCGTCGCACCGCGTGCCGGGCCTGGACCTTTCATCCATGTACCAC 

CCCGCAGTGGTGGAGCGTTGCAGGCTGGCGCTGGCAGCACAACGATC 

GGGCCAGCCAGAGCAGCGGAACAAGTTTTTGCGCACGATAGAGGTGA 

ATGGGTTCTACGACCCGGCCGACGCGACCATCCCTGAGGCCGTCGCA 



ACA.ATTCTGCCGGCAACTGCTGCGATTTCGCCTCCAAAGCTTGGCGCT 

CCGCACGACTCGCAACCCGAGGCGGAGGCTCGCCCCGTGGGCGAGGC 

CTCTGTGCCAAGGCGGGCCACGAGCTCGAGCAAATTGGCCAGGACGC 

TTGCCATCGATGCTTGCGACTCCGACGTGCGCGCCGCCTTGCTGGACC 

TGGACGCGCCAATCGCGGTCGGCGGCTCCTCGCGCGCCCAAGTCCCG 

CCGTGCCCAGTGAGCGCGCTCGGAAGCGCCGCCTTTCGAGCGGCACA 

CGGCGTCGATTATGCGCTCTACATGGGCGCAATGGCCAAAGGCGTCG 

CGTCAGCGGAGATGGTCATCGCTGCTGGCAAGGCCCGCATGCTCGCGT 

CATTTGGCGCGGGGGGGCTTCCCCTGGGCGAGGTCGAAGAGGCGTTG 

GACAAGATCCAGGCCGCTCTGCCCGAGGGGCCGTTCGCCGTCAACCT 

CATTCACTCGCCGTTCGATCCAAACCTTGAGGAGGGCAACGTCGAGCT 

GTTCCTGAGGCGCGGTATCCGGCTGGTCGAGGCCTCTGCGTTCATGTC 

GGTCACGCCGTCGTTGGTGCGCTACCGAGTCGCCGGACTCGAGCGAG 

GCCCTGGCGGGACCGCCCGAGTGCTGAACCGCGTGATTGGCAAGGTG 

AGCCGTGCGGAGCTCGCAGAAATGTTTATGCGGCCGCCTCCCGCCGCG 

ATCGTCTCCAAGCTCCTCGCCCAGGGCCTGGTCACTGAGGAGCAGGC 

GTCACTTGCAGAGATCGTCCCACTGGTTGACGACGTTGCAATCGAAGC 

CGACTCGGGCGGTCACACAGACAACCGCCCGATCCACGTCGnTTGCC 

CGTCGTCCTCGCGCTGCGAGACCGCGTCATGCGTGAGTGCAAGTATCC 

AGCCGCCAATCGCGTCCGCGTGGGCGCCGGAGGCGGGATCGGCTGCC 

CTGCCGCGGCGCGAGCTGCGTTCGACATGGGCGCAGCATTCGTTCTCA 

CGGGCTCGATCAACCAGCTCACGCGCCAGGCTGGGACGAGCGACAGC 

GTGCGTGCTGCCCTTGCACGCGCGACCTACTCGGACGTGACAATGGCC 

CCGGCGGCCGATATGTTTGACCAGGGCGTCAAGCTGCAGGTCTTGAAG 

CGCGGCACGATGTTCCCGGCGCGCGCAAACAAGCTGTACGAGTTGTTC 

ACCACTTACCAGTCGCTGGACGCGATCCCTCGGGCTGAGCTGGCTCGC 

CrGGAAAAGCGAGTTTTCCGCATGTCCATCGACGAGGTTTGGAACGA 

AACCAAGCAGTTCTACGAGACCCGGCTCAACAACCCCGCCAAGGTTG 

CCCGGGCGGAGCGCGACCCCAAGCTCAAGATGTCGCTCTGCTTTCGGT 



GGTACTTGTCGAAAAGCTCCAAGTGGGCATCGACTGGACAAGTTGGG 

CGCGAGCTGGACTACCAGGTCTGGTGCGGCCCCACGATTGGCGCTTTC 

AACGAGTTCGTGAAGGGGTCCAGCCTCGACGCGGAGGCTTGCGGGGG 

GCGGTTTCCTTGCGTTGTGCGCGTTAACCAGGAGATATTATGTGGCGCT 

GCTTACGAGCAGCGACTGGCGCGTTTCATGCTGCTCGCTGGCCGGGAA 

AGCGCGGACGCGTTGGCGTACACGGTTGCGGAAGCCAGATAG 

SEQ ID NO:10: 

RKCIRPSLGHHWAIIGVLGRALRTVRPIRYEATNLRRLPRSGWLVALGLFCD 

LSSCAGKLDLQTRDTAKDPCCKRKWSASRAPPRPRAEADKASNEMETKD 

DRVArVGMSAILPCGESVRESWEAIREGLDCLQDLPADRVDITAYYDPNKT 

TKDKTVCKRGGFIPEYDFDAREFGLNMFQMEDSDANQTVTLLKVKEALED 

AGVEPFTKKKKNIGCVLGIGGGQKASHEF^SRL>mA^VEKVLRKMNLPDE 

WEAAVEKYKANFPEWRLDSFPGFLGNVTAGRCSNVFNMEGMNCWDA 

AGASSUAIKVAIDELLHGDCDTMIAGATCTDNSIGMYMAFSKTPVFSTDQ 

SVKAYDAKTKGMLIGEGSAMVVLKRYADAVRDGDEIHAVIRACASSSDGK 

AAGr^APTVSGQEEALRRAYARAGVDPSTVTLVEGHGTGTPVGDPJELTAL 

R>A^DAANKGRKEWAVGSIKSQIGHLKAVAGFAGLVKVVMALKFiKTLP 

QTINVFfDPPALPTOGSPIQDSSLYINTMNRPWFTAPGVPRRAGISSFGFGGA 

NYHAVLEEAEPEHAKPYRMNQVPQPVLLHASSASALASICDAQADALQA 

AVSPEASKHADYRAIVAFHEAFKLRAGVPAGHARIGFVSGSAAATLAVLR 

AASAKLKQSSATLEWTLLREGVTYRSAAMHTPGSVAALFAGQGAQYTHM 

FADVAMNWPPFRSAVQEMDAAQVTAAAPKRLSEVLYPRKPYAAEPEQD 

NKAISMTINSQPALMACAAGAFEVFRQAGLAPDHVAGHSLGEFGALLAA 

GCASREELFRLVCSRAKAMQDVPKPSEGVMAAVIGRGADKLTLQGDGAW 

LANCNSPSQWISGDKTAVERESSRLAGLGFRIIPLACEGAFHSPHMTAAQ 

ATFQAALDSLKISTPTNGARLYNNVSGKTCRSLGELRDCLGKHMTSPVLFQ 

AQVENMYAAGARIFVEFGPKQVLSKLVGEILADKSDFVTVAVNSSSSKDSD 

VQLREAAAKLAVLGVPLANFDPWELCDARRLRECPRSKTTLRLSAATYVS 



NKTLAAREKVMEDNCDFSSLFASGPASQEMEREIANLRAELEAAQRQLDT 

AKTQLARKQVQDPTADRQRDMIAKHRSTLAAMVKEFEALASGSPCAVPF 

APVVDTAVEDWFADKVSTPPPQVTSAPIAELARAEAVVMEVLAAKTGYE 

VDMIEADMLLDAELGIDSVKRIEILAAVQAQLGVEAKDVDALSRTRTVGE 

WDAMKAEIGGQATSAPSPMAQPQASAPSPSPTASVLPKPVALPASVDPA 

KLARAEAWMEVLAAKTGYEVDMIEADMLLDAELGIDSVKRIEILAAVQA 

QLGVEAKDVDALSRTRTVGEWDAMKAEIGGQATSAPASVAQPQASAPS 

PSATTASVLPKPVAAPTSADPAKLARAEAWMEVLAAKTGYEVDMIEAD 

MLLDAELGIDSVKRIEILAAVQAQLGVEAKDVDALSRTRTVGEWEAMKA 

EIGGQATSAPASVAQPQISVSPTPLAASPSADPAKLARAEAWMEVLAAKT 

GYEVDMIEADMLLDAELGIDSVKRIEILAAVQAQLGVEAKDVDALSRTRT 

VGEWDAMKAEIGGQATSAPASVAQPQASAPSPSATASVLPKPVAAPTSA 

DPAKLARAEAVVMEVLAAKTGYEVDMIEADMLLDAELGIDSVKRIEILAA 

VQAQLGVEAKDVDALSRTRTVGEWEAMKAEIGGQATSAPASMAQPQIS 

VSPTPLAASPSADPAKLARAEAWMEVLAAKTGYEVDMIEADMLLDAEL 

GIDSVKRIEILAAVQAQLGVEAKDVDALSRTRTVGEWDAMKAEIGGQAT 

SAPASVAQPQASAPSPSATASAPVTPLAAPASVDPAKLARAEAWMEVLA 

AKTGYEVDMIEADMLLDAELGIDSVKRIEILAAVQAQLGVEAKDVDALSR 

TRTVGEWDAMKAEIGGQATSAPASVAQPQASAPSPSATASVLPKPVASP 

ASVDPAKLARAEAVVMEVLAAKTGYEVDMIDADMLLDAELGIDSVKRIEI 

LAAVQAQLGVEAKDVDALSRTRTVGEWEAMKAEIGAAGPNDAQAASG 

HLFGTGCEDLSLCSASWEIARCSELALERPMDRPILIVSDGSALPAALASRL 

GSCAVILTTAGETDQSVRSTKHVDMEGWGEADLVRALEAVESRFGVPGGV 

WLERASETARDQLGFALLLAKHSSKALNQQIPGGRACFVGVSRIDGKLGL 

SGACAKGKGWAEAAEIAQQGAVAGLCKTLDLEWPHVFARSIDIELGANE 

ETAAQAIFEELSCPDLTVREAGYTKDGKRWTTEARPVGLGKPKQALRSSDV 

FLVSGGARGITPVCVRELAKSISGGTFVLLGRSPLADDPAWACGVEEANIG 

TAAMAHLKAEFAAGRGPKPTPKAFDCALVGSVLGAREVLGSLESIRAQGA 

RAEYVSCDVSCAERVKAVVDDLERRVGAVTGVVHASGVLRDKSVERLELA 



DFEVWGTKVDGLLNLLQAVDRPKLRHLVLFSSLAGFHGNTGQAVYAMA 

NEALNKMAFHLETAMPGI^VIOIGFGPWDGGMVNDALKAPIFASMGVQI 

IPLDGGAETVSRIIGACSPTQVLVGNWGLPPWPNASVHKITVRLGGESAN 

PFLSSFfnQGRKVLPMTVALGLLAEAARGLYVGHQVVGIEDAQVFQGVVL 

DKGATCEVQLRRESSTASPSEWLSASLNVFAAGKWPAYRAHWLGASG 

PRTGGVQLELKDLGVDADPACSVGKGALYDGRTLFHGPAFQYMDEVLRC 

SPAELAVRCRWPSAAQDRGQFVSRGVLYDPFLNDTVFQALLVWARLVRD 

SASLPSNVERISFHGQPPSEGEVFYTTLKLDSAASGPLDPIAKAQFFLHRAC 
GAVFASGRASWLNKALSF 

SEQ ID NO:ll: 

QAIGHRAARWSCRSKSKARGHKAQKEMNQGGRNDEGVSVARADPCPDT 

RIAWGMAVEYAGCRGKEAFWDTLMNGKINSACISDDRLGSARREEHYA 

PERSKYADTFCNERYGCIDPKVDNEHDLLLGLAAAALQDAQDRRSDGGK 

FDPAQLKRCGrVSGCLSFPMDNLQGELLNLYQAHAERRIGKHCFADQTPW 

STRTRALHPLPGDPRTHRDPASFVAGQLGLGPLHYSLDAACASALYVLRL 

AQDHLLSGEADLMLCGATCFPEPFFILTGFSTFHAMPVGENGVSMPFHRD 

TQGLTPGEGGSVMVLKRLADAERDGDHTYGTLLGASLSNAGCGLPLKPHQ 

PSEEACLKATYELVGVPPRDVQYVECHATGTPQGDTVELQAVKACFEGAS 

PRIGSTKGNFGHTLVAAGFAGMCKVLLAMERGVIPPTPGVDSGTQIDPLV 

VTAALPWPDTRGGPKRAGLSAFGFGGTNAHAVFEEHIPSRAPPAVLCQPR 

LGSGPNRKLAIVGMDATFGSLKGLSALEAALYEARHAARPLPAKRWRFLG 

GDESFLFIEIGLECSPHGTOED\nDVDFKRLRTPMVPEDLLRPQQLLAVSTID 

KAILDSGLAKGGNVAVLVGLGTDLELYRHRARVALKERLQGLVRSAEGG 

ALTSRLMNYINDSGTSTSYTSYIGNLVATRVSSQWGFTGPSFTVTEGANSVH 

RCAQLAKYMLDRGEVDAVWAGVDLCGSAEAFFVRSRRMQISKSQRPAA 

PFDRAADGFFAGEGCGALWKRLTDCVSGERryASLDSVWATTPRAALRA 

AAGSARVDPASIDMVELSADSIiRFVRAPGTVAQPLTAEVEVGAVREVIGT 

AGRGSRSVAVGSVRANVGDAGFASGAAALVKTALCLHNRYLAATPGWD 



APAAGVDFGAELYVCRESRAWVKNAGVARHAAISGVDEGGSCYGLVLSD 

VPGQYETGNRISLQAESPKLLLLSAPDHAALLDKVAAELAALEQADGLSA 

AAAAVDRLLGESLVGCAAGSGGLTLCLVASPASLHKELALAHRGIPRCIK 

ARRDWASPAGSYFAPEPIASDRVAFMYGEGRSPYCGVGRDLHRIWPALHE 

RVNAKTVNLWGDGDAWLLPRATSAEEEEQLCRNFDSNQVEMFRTGVYIS 

MCLTDLARSLIGLGPKASFGLSLGEVSMLFALSESNCRLSEEMTRRLRASPV 

WNSELAVEFNALRKLWGVAPGAPVDSFWQGYWRATRAQVEQAIGEDN 

QFVRLLrVNDSQSVLIAGKPAACEAVIARIGSILPPLQVSQGMVGHCAEVLP 

YTSEIGRIHNMLRFPSQDETGGCKMYSSVSNSRIGPVEESQMGPGTELVFSPS 

MEDFVAQLYSRVADFPAITEAVYQQGHDVFVEVGPDHSRSAAVRSTLGPT 

RRFnAVAMDRKGESAWSQLLKMLATLASHRVPGLDLSSMYHPAWERCR 

LALAAQRSGQPEQRNKFLRTIEVNGFYDPADATIPEAVATILPATAAISPPK 

LGAPHDSQPEAEARPVGEASVPRRATSSSKLARTLAIDACDSDVRAALLDL 

DAPIAVGGSSRAQVPPCPVSALGSAAFRAAHGVDYALYMGAMAKGVASA 

EMVIAAGKARMLASFGAGGLPLGEVEEALDKIQAALPEGPFAVNLIHSPFD 

PNLEEGNVELFLRRGIRLVEASAFMSVTPSLVRYRVAGLERGPGGTARVLN 

RVIGKVSRAELAEMFMRPPPAArVSKLLAQGLVTEEQASLAErVPLVDDVAI 

EADSGGFTTDNRPIHWLPWLALRDRVMRECKYPAANRVRVGAGGGIGC 

PAAARAAFDMGAAFVLTGSINQLTRQAGTSDSVRAALARATYSDVTMAP 

AADMFDQGVKLQVLKRGTMFPARANKLYELFTTYQSLDAIPRAELARLEK 

RVFRMSIDEVWNETKQFYETRLNNPAKVARAERDPKLKMSLCFRWYLSKS 

SKWASTGQVGRELDYQVWCGPTIGAFNEFVKGSSLDAEACGGRFPCWRV 

NQEILCGAAYEQRLARFMLLAGRESADALAYTVAEAR 

SEQ ID NO:12: 

ATGGAGACAAAGGACGATCGCGTTGCGATCGTGGGCATGTCGGCCAT 
ACTGCCTTGCGGTGAGTCAGTGCGCGAGTCGTGGGAGGCGATTCGCG 
AGGGGCTCGATTGCCTGCAGGACCTGCCTGCGGACCGAGTCGATATC 
ACGGCGTACTACGACCCGAACAAGACAACCAAGGACAAGATCTACT 



GCAAGCGCGGCGGCTTCATTCCCGAGTATGACTTTGACGCGCGCGAGT 

TCGGCCTCAACATGTTCCAGATGGAGGACTCGGACGCCAACCAAACC 

GTGACTTTGCTCAAGGTCAAGGAGGCTCTCGAGGACGCCGGGGTGGA 

GCCCTTCACAAAGAAGAAGAAGAACATTGGCTGCGTGCTCGGCATCG 

GCGGCGGGCAGAAGGCGAGCCACGAGTTTTACTCCCGACTCAACTAT 

GTGGTCGTGGAGAAGGTGCTTCGCAAGATGAACCTCCCCGACGAGGT 

TGTCGAGGCCGCCGTCGAAAAGTACAAGGCCAACTTTCCTGAATGGC 

GCCTCGACTCGTTCCCTGGGTTTCTTGGCAACGTGACCGCCGGGCGGT 

GCAGCAACGTCTTCAACATGGAAGGCATGAACTGCGTCGTGGACGCT 

GCGTGCGCCAGCTCGCTCATCGCGATCAAGGTTGCCATTGATGAGCTC 

CTCCACGGGGACTGCGACACCATGATTGCCGGTGCGACCTGCACCGA 

CAACTCGATCGGGATGTACATGGCCTTTTCCAAAACCCCAGTTTTCTCC 

ACCGACCAGAGCGTCAAGGCGTACGACGCCAAGACGAAAGGCATGC 

TCATCGGCGAAGGCTCGGCCATGGTCGTGCTCAAGCGGTACGCGGAC 

GCCGTTCGGGATGGTGATGAGATCCATGCCGTCATCAGGGCATGCGCC 

TCGTCCAGCGACGGCAAGGCTGCTGGCATTTACGCACCGACGGTGTCG 

GGTCAAGAAGAGGCACTGCGGCGCGCGTACGCCCGAGCTGGCGTGGA 

CCCCTCCACCGTCACGCTGGTGGAGGGCCACGGCACTGGCACACCCG 

TCGGGGACCGGATTGAGCTGACCGCCTTGCGCAACGTCTTTGACGCAG 

CCAACAAAGGCCGCAAGGAAACAGTCGCGGTGGGAAGCATCAAGTC 

GCAGATCGGTCACCTGAAGGCCGTGGCCGGCTTTGCCGGTCTCGTCAA 

GGTTGTCATGGCCCTCAAGCACAAGACGCTGCCGCAGACCATCAACG 

TTCACGACCCGCCCGCACTGCACGACGGCTCGCCCATCCAGGATTCGA 

GTCTTTACATCAACACGATGAACCGGCCCTGGTTTACGGCACCTGGCG 

TCCCCCGCCGTGCAGGCATCTCTAGCTTTGGGTTTGGCGGCGCCAACT 

ACCACGCTGTTCTCGAAGAGGCCGAGCCTGAGCACGCGAAGCCGTAT 

CGCATGAACCAAGTTCCACAACCGGTGCTCTTGCACGCAAGCTCCGCG 

TCAGCTCTT 



SEQ ID NO:13: 

METKDDRVATVGMSAILPCGESVRESWEAIREGLDCLQDLPADRVDITAYY 
DPNRGGFIPEYDFDAREFGLNMFQMEDSDANQTVTLLKVKEALEDAGVEP 
FTK 

KKMSFIGCVLGIGGGQKASHEFy'SRLNYVVVEKVLRKMNLPDEVVEAAVEK 
YKANFPEWRLDSFPGFLGNV TAGRCSNVFN 
MEGMNCVVDAACASSLIAIK 

VAIDELLHGDCDTMIAGATCTDNSIGMYMAFSKTPVFSTDQSVKAYDAKT 
KGMLIGEGSAMVVLKRYADAVRDGDEIHAVIRACASSSDGKAAGIYAPTV 
SGQEEALRRAYARAGVDPSTVTLVEGHGTGTPVGDRIELTALRNVFDAAN 
KGRKETVAVG SIKSQIGHLK 
AVAGFAGLVKWMALKFDCTLPQTINVFIDPP 

ALFTOGSPIQDSSLY1NTMNRPWFTAPGVPRRAGISSFGFGGANYFLAVLEE 
AEPEHAKPYRMNQVPQPVLLHASSASAL 

SEQ ID NO:14: 

CAGTCGAGTGCGACGCTCGAATGGACCCTGCTCCGCGAGGGCGTCAC 

GTACCGCTCCGCCGCGATGCACACTCCTGGCAGTGTCGCTGCTCTGTTT 

GCCGGGCAAGGCGCGCAGTACACGCACATGTTCGCTGACGTTGCCAT 

GAACTGGCCACCGTTTCGAAGCGCCGTGCAAGAGATGGATGCCGCTC 

AAGTCACGGCGGCAGCGCCGAAGCGCCTCAGCGAGGTCCTGTATCCG 

CGCAAGCCGTACGCTGCAGAGCCCGAGCAAGACAACAAGGCCATCTC 

GATGACGATTAACTCGCAACCGGCCCTCATGGCCTGCGCTGCTGGGGC 

GTTTGAGGTGTTTCGTCAAGCTGGTCTTGCGCCCGACCACGTCGCGGG 

TCATTCTCTCGGCGAGTTTGGTGCTTTGCTCGCCGCTGGATGCGCAAGC 

CGTGAGGAGCTCTTCCGTCTGGTCTGCAGCAGAGCGAAGGCAATGCA 

AGACGTTCCCAAGCCAAGCGAGGGCGTCATGGCAGCTGTCATCGGCC 

GTGGTGCTGACAAGCTCACGCTGCAAGGCGATGGTGCGTGGCTTGCCA 

ACTGCAACTCGCCAAGCCAAGTGGTCATTTCCGGCGACAAGACTGCT 



GTCGAGCGTGAATCCAGCCGGTTGGCAGGCCTTGGCTTCAGGATCATT 

CCGCTTGCATGCGAAGGCGCCTTCCATTCACCGCACATGACGGCGGCC 

CAGGCCACGTTTCAGGCTGCACTGGACAGCCTCAAGATCTCCACCCCG 

ACGAACGGGGCGCGCCTGTACAACAACGTTTCCGGAAAGACCTGCCG 

ATCCCTGGGTGAACTCCGCGACTGCCTGGGCAAGCACATGACAAGTC 

CTGTGCTCTTCCAGGCACAGGTAGAGAACATGTACGCTGCCGGGGCG 

CGCATTTTCGTGGAGTTTGGCCCGAAGCAAGTCCTCTCCAAGCTCGTA 

GGCGAGATTCTCGCCGACAAGTCAGACTTTGTGACAGTCGCGGTCAAC 

TCGTCATCGTCCAAGGACAGCGACGTGCAACTTCGTGAAGCTGCTGCG 

AAGCTCGCGGTCCTTGGCGTCCCGTTGGCGAACTTTGACCCTTGGGAG 

CTCTGCGACGCGCGGCGTCTTCGCGAATGCCCGCGATCCAAGACGAC 

GTTGCGCTTGTCTGCAGCGACCTACGTGTCGAACAAGACCCTTGCTGC 

TAGGGAGAAGGTCATGGAGGACAACTGCGACTTTTCTTCGCTCTTTGC 

CTCCGGTCCAGCAAGCCAAGAGATGGAGCGAGAAATAGCCAACCTTC 

GCGCTGAGCTGGAGGCGGCCCAACGCCAGCTTGACACGGCCAAA 

SEQ ID NO:15: 

QSSATLEWTLLREGVTYRSAAMHTPGSVAALFAGQGAQYTHMFADVAM 

NWPPFRSAVQEMDAAQVTAAAPKRLSEVLYPRKPYAAEPEQDNKAISMTI 

NSQPALMACAAGAFEVFRQAGLAPDHVAGHSLGEFGALLAAGCASREEL 

FRLVCSRAKAMQDVPKPSEGVMAAVIGRGADKLTLQGDGAWLANCNSP 

SQWISGDKTAVERESSRLAGLGFRIIPLACEGAFHSPHMTAAQATFQAAL 

DSLKISTPTNGARLYNNVSGKTCRSLGELRDCLGKHMTSPVLFQAQVENM 

YAAGARIFVEFGPKQVLSKLVGEILADKSDFVTVAVNSSSSKDSDVQLREA 

AAKLAVLGVPLANFDPWELCDARRLRECPRSKTTLRLSAATYVSNKTLAA 

REKVMEDNCDFSSLFASGPASQEMEREIANLRAELEAAQRQLDTAK 



SEQ ID NO:16: 



CAAGTCACTTCCGCTCCCATCGCCGAGCTCGCGCGCGCCGAGGCCGTC 

GTCATGGAGGTTCTCGCTGCCAAGACTGGCTACGAGGTCGACATGATC 

GAGGCCGACATGCTGCTCGACGCCGAGCTCGGCATCGACTCGGTCAA 

GCGCATTGAGATCCTGGCAGCTGTCCAGGCCCAGCTCGGGGTCGAGG 

CCAAGGACGTCGACGCGCTCAGCCGCACACGAACAGTTGGCGAGGTC 

GTTGACGCCATGAAGGCTGAGATCGGCGGG 

SEQIDNO:17: 

QVTSAPIAELARAEAVVMEVLAAKTGYEVDMIEADMLLDAELGIDSVKRIE 
ILAAVQAQLGVEAKDVDAL SRTRTVGEWDAMKAEIGG 

SEQ ID NO:18: 

CATCTCTTTGGCACGGGATGTGAAGACCTGAGCCnTGCTCTGCTTCTG 

TGGTTGAGATTGCTCGTTGCAGCGAACTAGCTCTGGAGCGCCCGATGG 

ATCGGCCCATTCTTATTGTAAGCGATGGATCAGCATTGCCGGCGGCTC 

TGGCTAGTCGACTGGGGTCGTGTGCAGTAATCCTCACGACCGCAGGCG 

AGACCGACCAATCTGTGCGCTCGACGAAGCACGTTGACATGGAAGGG 

TGGGGCGAGGCAGATCTCGTGCGCGCTCTTGAAGCAGTAGAGTCTCG 

ATTCGGCGTCCCAGGCGGCGTCGTGGTGCTTGAGCGCGCCTCAGAAAC 

AGCTAGGGACCAGCTTGGCTTTGCCCTGCTGCTTGCCAAGCATTCGAG 

CAAAGCGCTCAACCAGCAGATCCCAGGCGGGCGCGCCTGCTTCGTGG 

GCGTCTCGCGAATCGACGGAAAGCTCGGACTTAGCGGAGCTTGCGCG 

AAAGGAAAGGGCTGGGCTGAGGCCGCAGAGATTGCTCAGCAAGGAG 

CCGTCGCGGGCTTGTGCAAGACCTTGGACCTAGAGTGGCCGCACGTCT 

TCGCTCGCAGCATCGACATCGAGCTTGGCGCGAACGAAGAAACAGCT 

GCGCAAGCAATCTTTGAGGAGCTCTCTTGCCCGGACCTAACGGTGCGC 

GAAGCAGGATACACCAAAGACGGCAAGCGGTGGACGACTGAGGCGC 

GACCGGTTGGGCTTGGCAAGCCCAAGCAGGCACTACGTTCTTCGGAC 

GTCTTCTTGGTTTCTGGTGGGGCGCGGGGAATTACACCTGTTTGCGTTC 



GCGAGTTGGCCAAATCGATCAGTGGTGGCACTTTTGTCCTCCTCGGGC 

GGTCCCCTCrCGCTGATGATCCGGCGTGGGCTTGCGGCGTCGAGGAAG 

CAAACATTGGGACAGCCGCTATGGCGCACCTCAAGGCCGAGTTCGCA 

GCCGGGCGCGGCCCGAAGCCGACGCCAAAGGCCCACAAAGCACTCG 

TTGGGAGCGTCCTGGGGGCGCGCGAAGTCCTTGGTTCGCTAGAGAGTA 

TTCGCGCCCAGGGTGCGCGCGCCGAGTACGTTTCCTGCGACGTTTCGT 

GTGCGGAGCGCGTCAAGGCCGTCGTCGACGATCTCGAGCGACGGGTC 

GGGGCTGTAACTGGGGTTGTGCACGCCTCTGGTGTTCTCCGAGACAAG 

TCCGTTGAGCGCTTGGAGCTCGCCGACTTCGAGGTCGTGTACGGCACC 

AAGGTGGACGGCCTGCTCAACCTGCTGCAGGCCGTGGACCGCCCCAA 

ACTCCGGCACTTGGTCCTCTTCAGCTCCCTGGCCGGTTTCCACGGCAAC 

ACTGGGCAGGCCGTGTACGCTATGGCGAATGAGGCGCTGAACAAGAT 

GGCCTTCCATTTGGAAACTGCGATGCCTGGCCTCTCGGTCAAGACGAT 

CGGGTTTGGACCTTGGGACGGCGGCATGGTCAACGATGCGCTGAAAG 

CGCACTTTGCGTCTATGGGCGTCCAAATTATTCCGCTCGACGGCGGCG 

CGGAGACCGTTTCCCGAATCATCGGGGCGTGCTCGCCAACACAAGTTC 

TGGTTGGCAACTGGGGCTTGCCCCCTGTAGTTCCTAACGCGAGCGTGC 

ACAAGATTACTGTGAGGCITGGCGGGGAGTCTGCAAACCCTTTCCTGT 

CCTCGCACACGATTCAAGGCAGAAAGGTCTTGCCGATGACTGTGGCG 

CTTGGGCTTCTCGCTGAGGCGGCTCGAGGGCTCTACGTCGGTCACCAA 

GTAGTCGGGATTGAGGACGCCCAAGTCTTCCAGGGAGTCGTGTTGGAC 

AAAGGGGCGACGTGTGAGGTCCAGCTTCGCCGCGAGTCTTCGACTGC 

AAGCCCAAGCGAGGTTGTGCTGAGTGCTTCGCTCAATGTATTCGCGGC 

GGGAAAGGTTGTGCCTGCGTACCGCGCGCATGTCGTGCTCGGCGCTTC 

AGGGCCACGCACTGGCGGCGTGCAGCTTGAACTGAAAGATTTGGGCG 

TGGACGCCGACCCTGCTTGCTCCGTTGGCAAGGGTGCGCTGTACGACG 

GTAGGACGCTGTTCCATGGGCCGGCGTTTCAGTACATGGATGAGGTTC 

TTCGGTGCTCGCCTGCAGAGCTTGCCGTGCGGTGCCGTGTCGTTCCGA 

GCGCGGCTCAGGACCGCGGCCAATTTGTTTCGCGCGGAGTGTTGTACG 



ACCCGTTCCTGAACGACACGGTGTTTCAAGCTCTCCTTGTTTGGGCCCG 

TCTGGTCAGGGACAGCGCTTCGCTACCGAGCAACGTTGAACGAATCTC 

GTTCCACGGCCAGCCGCCGAGCGAGGGCGAGGTGTTTTACACCACGC 

TCAAGCTGGACAGTGCTGCGAGCGGGCCGCTCGACCCGATTGCAAAG 

GCGCAGTTCTTCCTCCACCGAGCTTGCGGGGCGGTCTTTGCATCAGGG 

CGAGCGAGTGTGGTTCTGAACAAGGCTCTTTCGTTT 

SEQ ID NO:19: 

ASGHLFGTGCEDLSLCSASWEIARCSELALERPMDRPILIVSDGSALPAAL 

ASRLGSCAVILTTAGETDQSVRSTKHVDMEGWGEADLVRALEAVESRFGV 

PGGVWLERASETARDQLGFALLLAKHSSKALNQQIPGGRACFVGVSRIDG 

KLGLSGACAKGKGWAEAAEIAQQGAVAGLCKTLDLEWPHVFARSIDIEL 
GANEETAA 

QAIFEELSCPDLTVREAGYTKDGKRWTTEARPVGLGKPKQALRSSDVFLV 

SGGARGITPVCVRELAKSISGGTFVLLGRSPLADDPAWACGVEEANIGTA 

AMAHLKAEFAAGRGPKPTPKAHKALVGSVLGAREVLGSLESIRAQGARA 
E 

YVSCDVSCAERVKAWDDLERRVGAVTGWHASGVLRDKSVERLELADFE 

VWGTKVDGLLNLLQAVDRPKLRHLVLFSSLAGFHGNTGQAVYAMANE 
AL 

NKMAFHLETAMPGl^VKTIGFGPWDGGMVNDALKAHFASMGVQnPLDG 
G 

AETVSRIIGACSPTQVLVGNWGLPPVVPNASVHKITVRLGGESANPFLSS 

HTIQGRKVLPMTVALGLLAEAARGLYVGHQWGIEDAQVFQGWLDKGA 
T 

CEVQLRRESSTASPSEVVLSASLNVFAAGKVVPAYRAFJVVLGASGPRTGG 

VQLELKDLGVDADPACSVGKGALYDGRTLFHGPAFQYMDEVLRCSPAEL 
A 

VRCRWPSAAQDRGQFVSRGVLYDPFLNDTVFQALLVWARLVRDSASLPS 



NVERISFHGQPPSEGEVFYTTLKLDSAASGPLDPIAKAQFFLHRACGAVF 
ASGRASWLNKALSF 



SEQ ID NO:20: 

ATGAACCAGGGCGGGAGAAATGACGAGGGCGTCTCGGTGGCGCGCGCG 

GACCCATGCCCTGACACGCGGATCGCTGTCGTGGGCATGGCGGTCGAGTA 

TGCAGGGTGCCGCGGCAAGGAAGCGTTCTGGGACACGCTCATGAACGGC 

AAAATCAACTCTGCCTGTATCTCAGACGATCGCCTCGGGTCAGCACGACG 

AGAAGAGCACTATGCGCCCGAGAGGTCAAAGTACGCCGATACGTTCTGC 

AACGAGAGGTACGGATGCATCGATCCCAAAGTCGACAACGAGCACGAC 

CTGCTCCTCGGCCTCGCCGCGGCTGCGCTTCAAGACGCGCAGGACAGGCG 

CAGCGACGGCGGCAAGTTCGACCCAGCGCAGCTCAAGCGCTGCGGCATT 

GTCAGCGGCTGCCTGTCCTTCCCGATGGACAACCTGCAAGGCGAGCTGCT 

CAACCTTTACCAAGCCCATGCTGAGAGGCGGATTGGCAAGCATTGCTTCG 

CGGACCAAACGCCCTGGTCGACGCGAACCAGAGCGCTTCACCCGCTGCC 

CGGGGACCCGAGGACCCACCGCGACCCAGCCTCCTTCGTCGCCGGACAG 

CTCGGCCTCGGCCCGCTGCACTACTCGCTCGACGCCGCCTGCGCCTCGGC 

CCTTTACGTTCTGCGACTCGCTCAGGACCACCTCCTCTCGGGCGAGGCTG 

ACTTGATGCTGTGCGGAGCGACGTGCTTCCCAGAGCCCTTCTTCATCCTGA 

CTGGGTTTAGCACGTTCCACGCGATGCCAGTCGGTGAGAACGGTGTCTCG 

ATGCCGTTTCATCGGGACACGCAAGGGCTGACGCCCGGCGAGGGCGGCT 

CGGTGATGGTGCTCAAGCGCCTCGCGGACGCCGAGCGCGACGGAGACCA 

CATCTACGGGACGCTTCTTGGAGCCAGCTTGAGCAACGCAGGCTGCGGG 

CTTCCTCTCAAGCCGCACCAGCCAAGCGAGGAGGCCTGCTTGAAAGCCA 

CCTACGAGCTCGTCGGCGTGCCGCCCCGAGACGTCCAGTACGTCGAGTGC 

CACGCCACCGGCACGCCGCAGGGCGACACCGTCGAGCTCCAAGCCGTCA 

AAGCCTGCTTTGAGGGCGCAAGCCCCCGGATCGGGTCCACGAAAGGCAA 

CTTCGGACACACCCTCGTCGCGGCCGGCTTTGCGGGAATGTGCAAGGTTC 

TCCTTGCAATGGAGCGCGGCGTGATCCCCCCGACCCCGGGCGTTGACTCT 



GGCACCCAGATTGATCCCCTCGTCGTCACAGCGGCGCTCCCGTGGCCGGA 

TACGCGCGGCGGGCCGAAACGCGCAGGACTCTCCGCATTCGGATTCGGG 

GGCACAAACGCGCACGCCGTCTTTGAGGAGCATATTCCCTCGAGAGCT 



SEQ ID NO: 21: 

MNQGGRNDEGVSVARADPCPDTRIAVVGMAVEYAGCRGKEAFWDTLMNG 

KINSACISDDRLGSARREEHYAPERSKYADTFCNERYGCIDPKVDNEHDLLLG 

LAAAALQDAQDRRSDGGKFDPAQLKRCGIVSGCLSFPMDNLQGELLNLYQA 

HAERRIGKHCFADQTPWSTRTRALHPLPGDPRTHRDPASFVAGQLGLGPLHY 

SLDAACASALYVLRLAQDHLLSGEADLMLCGATCFPEPFFILTGFSTFHAMPV 

GENGVSMPFHRDTQGLTPGEGGSVMVLKRLADAERDGDFHYGTLLGASLSN 

AGCGLPLKPHQPSEEACLKATYELVGVPPRDVQYVECHATGTPQGDTVELQA 

VKACFEGASPRIGSTKGNFGHTLVAAGFAGMCKVLLAMERGVTPPTPGVDSG 

TQIDPLWTAALPWPDTRGGPKRAGLSAFGFGGTNAHAVFEEHIPSRA 

SEQ ID NO: 22: 

CAGCCTCGCCTCGGCAGCGGACCAAACCGAAAGCTTGCTATCGTCGGCA 

TGGATGCCACGTTTGGATCCTTGAAGGGTCTCTCCGCACTAGAAGCTGCG 

CTTTACGAGGCAAGGCACGCTGCGCGGCCCCTGCCTGCGAAGCGCTGGC 

GCTTCTTGGGCGGGGACGAGTCCTTTCTCCACGAGATCGGACTCGAGTGC 

TCTCCGCACGGGTGCTACATTGAGGACGTGGATGTGGACTTTAAGCGACT 

CCGCACGCCAATGGTGCCGGAGGACTTGCTCCGGCCGCAACAGCTCCTG 

GCCGTGTCGACGATTGACAAGGCCATCCTCGACTCGGGCTTGGCCAAGG 

GCGGCAACGTGGCTGTCCTTGTCGGCCTCGGGACGGACCTCGAGCTCTAC 

CGCCACCGAGCTCGGGTTGCGCTTAAGGAGCGTCTTCAAGGACTGGTTCG 

CTCTGCCGAGGGAGGAGCCCTGACGTCTCGCCTGATGAACTATATCAATG 

ATAGCGGAACGTCGACCTCCTACACGTCGTATATCGGCAACCTCGTCGCC 

ACGCGCGTCTCGTCCCAGTGGGGCTTCACTGGGCCGTCGTTCACCGTCAC 

GGAAGGGGCCAACTCGGTCCATCGGTGCGCCCAGCTCGCCAAGTACATG 



CTCGACCGCGGCGAGGTCGACGCCGTCGTGGTTGCAGGAGTCGACCTGTG 

CGGGAGCGCCGAGGCGTTCTTCGTGAGGTCGCGCCGCATGCAGATCTCGA 

AAAGTCAGCGCCCGGCCGCGCCGTTTGACCGCGCCGCAGACGGCTTCTTC 

GCGGGGGAAGGGTGCGGCGCCCTCGTCTTCAAACGCCTGACTGACTGTGT 

GTCTGGCGAGCGAATCTACGCGTCCCTCGACTCGGTCGTCGTCGCAACCA 

CGCCGCGCGCCGCTCTTCGTGCTGCCGCAGGGTCGGCGCGGGTTGACCCA 

GCCAGCATCGACATGGTCGAGCTGAGCGCAGATTCCCACCGGTTTGTGCG 

GGCGCCAGGCACCGTGGCTCAGCCTCTGACAGCCGAAGTCGAGGTCGGG 

GCGGTGCGGGAAGTGATCGGGACCGCGGGGAGGGGCTCTCGAAGCGTGG 

CCGTCGGATCGGTCCGCGCCAACGTCGGGGACGCAGGGTTTGCTTCCGGG 

GCCGCTGCCCTCGTAAAAACTGCGCTCTGCTTGCACAACCGCTACTTGGC 

GGCTACCCCAGGCTGGGATGCGCCTGCTGCCGGCGTGGATTTTGGTGCCG 

AGCTGTACGTTTGCCGCGAGTCGCGTGCTTGGGTCAAGAACGCCGGCGTT 

GCACGGCACGCCGCAATTTCTGGCGTGGACGAAGGCGGGTCG 

SEQ ID NO:23: 

QPRLGSGPNRKLAIVGMDATFGSLKGLSALEAALYEARHAARPLPAKRWRFL 

GGDESFLHEIGLECSPHGCYIED\T>VDFKRLRTPMVPEDLLRPQQLLAVSTTDK 

AILDSGLAKGGNVAVLVGLGTDLELYRHRARVALKERLQGLVRSAEGGALTS 

RLMNYINDSGTSTSYTSYIGNLVATRVSSQWGFTGPSFTVTEGANSVHRCAQL 

AKYMLDRGEVDAVWAGVDLCGSAEAFFVRSRRMQISKSQRPAAPFDRAAD 

GFFAGEGCGALVFKRLTDCVSGERTVASLDSVWATTPRAALRAAAGSARVDP 

ASIDMVELSADSHRFVRAPGTVAQPLTAEVEVGAVREVIGTAGRGSRSVAVGS 

VRANVGDAGFASGAAALVKTALCLHNRYLAATPGWDAPAAGVDFGAELYV 

CRESRAWVKNAGVARHAAISGVDEGGS 

SEQ ID NO:24: 

TGCTATGGGCrGGTTCTTTCGGACGTGCCTGGGCAGTACGAGACCGGCAA 
CCGCATCTCCCTCCAGGCCGAGTCGCCCAAGCTCTTGCTCCTCTCGGCTCC 



AGACCACGCCGCCTTGCTGGACAAGGTGGCGGCCGAGCTCGCAGCCCTT 

GAGCAAGCCGACGGCTTGAGCGCCGCCGCGGCTGCCGTAGACCGCTTAC 

TCGGCGAGTCGCTCGTCGGTTGCGCGGCTGGCAGCGGCGGGCTGACCCTT 

TGCTTGGTGGCTTCGCCTGCCAGCCTCCACAAGGAGCTTGCGCTGGCCCA 

TCGAGGGATCCCGCGCTGCATCAAAGCACGGCGCGACTGGGCCAGCCCG 

GCAGGGAGCTACTTCGCCCCGGAGCCGATCGCAAGCGACCGCGTCGCGT 

TCATGTACGGGGAAGGACGAAGCCCGTACTGCGGCGTCGGCCGCGACCT 

CCACCGGATCTGGCCCGCGCTGCATGAGCGGGTGAACGCCAAGACTGTC 

AACCTCTGGGGTGACGGTGACGCCTGGCTGCTGCCACGTGCAACCTCGGC 

CGAGGAAGAGGAGCAACTCTGCCGCAACTTCGACTCGAACCAGGTTGAG 

ATGTTTCGAACGGGCGTGTACATCTCGATGTGCTTGACCGACCTCGCTCG 

AAGCTTGATTGGACTGGGCCCTAAGGCGAGCTTTGGGCTCAGCCTAGGCG 

AGGTTTCCATGCTCTTCGCTCTGAGCGAGTCCAACTGTAGACTGTCGGAG 

GAAATGACCCGCAGGCTCCGTGCGTCCCCGGTGTGGAACTCGGAGCTCG 

CCGTCGAGTTCAACGCCCTTCGAAAGTTGTGGGGGGTCGCGCCGGGGGC 

ACCCGTCGACTCGTTCTGGCAAGGTTATGTCGTGCGCGCAACGCGGGCTC 

AGGTGGAGCAAGCCATTGGGGAGGACAATCAGTTTGTGCGTCTCCTGATC 

GTGAACGACTCGCAATCAGTCCTGATCGCCGGCAAGCCGGCGGCGTGCG 

AAGCCGTAATTGCTCGCATCGGGTCTATTCTTCCCCCGCTGCAAGTGTCGC 

AAGGCATGGTGGGGCACTGTGCCGAGGTCTTGCCGTACACGAGCGAGAT 

CGGGCGCATCCACAACATGCTTCGCTTCCCATCGCAGGACGAAACGGGC 

GGTTGCAAAATGTACTCTAGCGTCTCAAACTCGCGCATCGGGCCAGTCGA 

GGAGAGCCAGATGGGCCCAGGCACTGAGCTCGTTTTCTCGCCGTCAATGG 

AAGACTTTGTCGCCCAGCTGTACTCGCGAGTTGCAGACTTTCCGGCGATC 

ACCGAGGCGGTTTACCAGCAGGGTCATGACGTGTTTGTCGAAGTGGGGCC 

GGACCATTCACGGTCGGCTGCTGTCCGCTCCACGCTTGGACCCACTCGGC 

GACACATCGCTGTGGCGATGGACCGCAAGGGTGAGTCAGCTTGGTCGCA 

GCTTCTGAAAATGCTGGCTACGCTTGCGTCGCACCGCGTGCCGGGCCTG 



SEQ ID NO:25: 

CYGLVLSDVPGQYETGNRISLQAESPKLLLLSAPDHAALLDKVAAELAALEQA 

DGLSAAAAAVDRLLGESLVGCAAGSGGLTLCLVASPASLHKELALAHRGIPR 

CIKARRDWASPAGSYFAPEPIASDRVAFMYGEGRSPYCGVGRDLHRIWPALHE 

RVNAKTVNLWGDGDAWLLPRATSAEEEEQLCRNFDSNQVEMFRTGVYISMC 

LTDLARSLIGLGPKASFGLSLGEVSMLFALSESNCRLSEEMTRRLRASPVWNSEL 

AVEFNALRKLWGVAPGAPVDSFWQGYVVRATRAQVEQAIGEDNQFVRLLIV 

NDSQSVUAGKPAACEAVIARIGSILPPLQVSQGMVGHCAEVLPYTSEIGRIHN 

MLRFPSQDETGGCKMYSSVSNSRIGPVEESQMGPGTELVFSPSMEDFVAQLYSR 

VADFPAITEAVYQQGHDVFVEVGPDHSRSAAVRSTLGPTRRHIAVAMDRKGE 

SAWSQLLKMLATLASHRVPGL 

SEQ ID NO:26: 

GCGACCATCCCTGAGGCCGTCGCAACAATTCTGCCGGCAACTGCTGCGAT 

1TCGCCTCCAAAGCTTGGCGCTCCGCACGACTCGCAACCCGAGGCGGAG 

GCTCGCCCCGTGGGCGAGGCCTCTGTGCCAAGGCGGGCCACGAGCTCGA 

GCAAATTGGCCAGGACGCTTGCCATCGATGCTTGCGACTCCGACGTGCGC 

GCCGCCTTGCTGGACCTGGACGCGCCAATCGCGGTCGGCGGCTCCTCGCG 

CGCCCAAGTCCCGCCGTGCCCAGTGAGCGCGCTCGGAAGCGCCGCCTTTC 

GAGCGGCACACGGCGTCGATTATGCGCTCTACATGGGCGCAATGGCCAA 

AGGCGTCGCGTCAGCGGAGATGGTCATCGCTGCTGGCAAGGCCCGCATG 

CTCGCGTCATTTGGCGCGGGGGGGCTTCCCCTGGGCGAGGTCGAAGAGGC 

GTTGGACAAGATCCAGGCCGCTCTGCCCGAGGGGCCGTTCGCCGTCAACC 

TCATTCACTCGCCGTTCGATCCAAACCTTGAGGAGGGCAACGTCGAGCTG 

TTCCTGAGGCGCGGTATCCGGCTGGTCGAGGCCTCTGCGTTCATGTCGGTC 

ACGCCGTCGTTGGTGCGCTACCGAGTCGCCGGACTCGAGCGAGGCCCTG 

GCGGGACCGCCCGAGTGCTGAACCGCGTGATTGGCAAGGTGAGCCGTGC 

GGAGCTCGCAGAAATGTTTATGCGGCCGCCTCCCGCCGCGATCGTCTCCA 

AGCTCCTCGCCCAGGGCCTGGTCACTGAGGAGCAGGCGTCACTTGCAGA 



GATCGTCCCACTGGTTGACGACGTTGCAATCGAAGCCGACTCGGGCGGTC 

ACACAGACAACCGCCCGATCCACGTCGTTTTGCCCGTCGTCCTCGCGCTG 

CGAGACCGCGTCATGCGTGAGTGCAAGTATCCAGCCGCCAATCGCGTCC 

GCGTGGGCGCCGGAGGCGGGATCGGCTGCCCTGCCGCGGCGCGAGCTGC 

GTTCGACATGGGCGCAGCATTCGTTCTCACGGGCTCGATCAACCAGCTCA 

CGCGCCAGGCTGGGACGAGCGACAGCGTGCGTGCTGCCCTTGCACGCGC 

GACCTACTCGGACGTGACAATGGCCCCGGCGGCCGATATGTTTGACCAG 

GGCGTCAAGCTGCAGGTCTTGAAGCGCGGCACGATGTTCCCGGCGCGCG 

CAAACAAGCTGTACGAGTTGTTCACCACTTACCAGTCGCTGGACGCGATC 

CCTCGGGCTGAGCTGGCTCGCCTGGAAAAGCGAGTTTTCCGCATGTCCAT 

CGACGAGGTTTGGAACGAAACCAAGCAGTTCTACGAGACCCGGCTCAAC 

AACCCCGCCAAGGTTGCCCGGGCGGAGCGCGACCCCAAGCTCAAGATGT 

CGCTCTGCTTTCGGTGGTACTTGTCGAAAAGCTCCAAGTGGGCATCGACT 

GGACAAGTTGGGCGCGAGCTGGACTACCAGGTCTGGTGCGGCCCCACGA 

TTGGCGCTTTCAACGAGTTCGTGAAGGGGTCCAGCCTCGACGCGGAGGCT 

TGCGGGGGGCGGTTTCCTTGCGTTGTGCGCGTTAACCAGGAGATATTATG 

TGGCGCTGCTTACGAGCAGCGACTGGCGCGTTTCATGCTGCTCGCTGGCC 

GGGAAAGCGCGGACGCGTTGGCGTACACGGTTGCGGAAGCCAGATAG 

SEQ ID NO:27: 

AHPEAVATILPATAAISPPKLGAPHDSQPEAEARPVGEASVPRRATSSSKLART 

LAIDACDSDVRAALLDLDAPIAVGGSSRAQVPPCPVSALGSAAFRAAHGVDY 

ALYMGAMAKGVASAEMVIAAGKARMLASFGAGGLPLGEVEEALDKIQAALP 

EGPFAVNLIHSPFDPNLEEGNVELFLRRGIRLVEASAFMSVTPSLVRYRVAGLE 

RGPGGTARVLNRVIGKVSRAELAEMFMRPPPAArVSKLLAQGLVTEEQASLAE 

rvTPLVDDVAIEADSGGHTDNRPIHVVLPVVLALRDRVMRECKYPAANRVRVG 

AGGGIGCPAAARAAFDMGAAFVLTGSINQLTRQAGTSDSVRAALARATYSDV 

TMAPAADMFDQGVKLQVLKRGTMFPARANKLYELFTTYQSLDAIPRAELARL 

EKRWRMSIDEVWNETKQFYETRLNNPAKVARAERDPKLKMSLCFRWYLSKS 



SKWASTGQVGRELDYQVWCGPTIGAFNEFVKGSSLDAEACGGRFPCWRVN 
QEILCGAAYEQRLARFMLLAGRESADALAYTVAEAR 



